AccentBox: Towards High-Fidelity Zero-Shot Accent Generation
Authors: Jinzuomu Zhong, Korin Richmond, Zhiba Su, Siqi Sun
Paper | 
        Arxiv | 
        Demo[Here] | 
        Code | 
        
Authors: Jinzuomu Zhong, Korin Richmond, Zhiba Su, Siqi Sun
Paper | 
        Arxiv | 
        Demo[Here] | 
        Code | 
        
| Reference Speech | Reference Text | 
| 
    This is a very common type of bow, one showing mainly red and yellow, with little or no green or blue.  | 
| Target Text | VALL-E X Generation | GenAID Prediction | 
| 1. Well, here's a story for you. | Australian | |
| 
        2. Sarah Perry was a veterinary nurse who had been working daily at an old zoo in a deserted district of the territory.  | 
    Australian | |
| 
        3. So, she was very happy to start a new job at a superb private practice in North Square near the Duke Street Tower.  | 
    American | |
| 4. That area was much nearer for her and more to her liking. | English | |
| 5. Even so, on her first morning, she felt stressed. | English | 
| Reference Speech | Reference Text | 
| 
    This is a very common type of bow, one showing mainly red and yellow, with little or no green or blue.  | 
| Target Text | VALL-E X Generation | GenAID Prediction | 
| 1. Well, here's a story for you. | Australian | |
| 
        2. Sarah Perry was a veterinary nurse who had been working daily at an old zoo in a deserted district of the territory.  | 
    Canadian | |
| 
        3. So, she was very happy to start a new job at a superb private practice in North Square near the Duke Street Tower.  | 
    American | |
| 4. That area was much nearer for her and more to her liking. | American | |
| 5. Even so, on her first morning, she felt stressed. | Canadian | 






| Reference Speech (Speaker & Accent) | Note | 
| 
            speaker: p225 accent: English  | 
        
| Stimuli | Baseline | Accent_ID | Proposed | 
| #1 | |||
| #2 | |||
| #3 | |||
| #4 | |||
| #5 | 
| Reference Speech (Speaker & Accent) | Note | 
| 
            speaker: p294 accent: American  | 
        
| Stimuli | Baseline | Accent_ID | Proposed | 
| #1 | |||
| #2 | |||
| #3 | |||
| #4 | |||
| #5 | 
| Reference Speech (Speaker & Accent) | Note | 
| 
            speaker: p245 accent: Irish  | 
        
| Stimuli | Baseline | Accent_ID | Proposed | 
| #1 | |||
| #2 | |||
| #3 | |||
| #4 | |||
| #5 | 
| Reference Speech (Speaker) | Note | 
| 
            speaker: p225 accent: English  | 
        
| Reference Speech (Accent) | Note | 
| 
            speaker: p294 accent: American  | 
        
| Stimuli | Accent_ID | Proposed | 
| #1 | ||
| #2 | ||
| #3 | ||
| #4 | ||
| #5 | 
| Reference Speech (Accent) | Note | 
| 
            speaker: p245 accent: Irish  | 
        
| Stimuli | Accent_ID | Proposed | 
| #1 | ||
| #2 | ||
| #3 | ||
| #4 | ||
| #5 | 
| Reference Speech (Speaker & Accent) | Note | 
| 
            speaker: p335 accent: New Zealand  | 
        
| Stimuli | Baseline | Proposed | 
| #1 | ||
| #2 | ||
| #3 | ||
| #4 | ||
| #5 | 
| Reference Speech (Speaker & Accent) | Note | 
| 
            speaker: p253 accent: Welsh  | 
        
| Stimuli | Baseline | Proposed | 
| #1 | ||
| #2 | ||
| #3 | ||
| #4 | ||
| #5 | 
